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S"^ Abstract 

In this paper, we study the distribution of distances in random Apollonian network structures 
(RANS), a family of graphs which has a one-to-one correspondence with planar ternary trees. 
Using multivariate generating functions that express all information on distances, and singularity 
analysis for evaluating the coefficients of these functions, we describe the distribution of distances 
i | to an outermost vertex, and show that the average value of the distance between any pair of 

vertices in a RANS of order n is asymptotically \fn. 

u 

^ 1 Introduction 

Many graph models have been recently introduced for representing the structure and dynamics of real- 
i ^ i life networks (see e.g. [3]). Their adequacy to data can be measured by comparing some properties of 

graphs, especially the degree distribution of the vertices, which is related to scale-free properties, and 
properties related to the "small world" effect, such as distance between pairs of vertices and grouping 
in clusters. 

The random Apollonian networks (RAN) proposed by [3] provides a very interesting model, with a 
power-law degree distribution, a mean distance of logarithmic order and a large clustering coefficient. 
We introduced in Q a modified version, random Apollonian network structures (RANS), which pre- 
serves the interesting properties of real graphs concerning degree distribution (a power-law with an 
exponential cut-off) and large clustering. This paper is devoted to the analysis of distances in RANS, 
which is showed to be of square root order: we first characterize the distances from one special vertex 
to all the other vertices of the graph, and then work on the distances between pairs of vertices. 

A RANS can be seen as a certain type of triangulation of a triangle, and the study of RANS 
relies on the bijection with planar ternary trees (see figure [TJ. From this bijection we can express the 
cnumerative generating function for RANS, and use multivariate functions for marking several distance 
parameters. Moreover the asymptotic values of the quantities under consideration can be dealt with 
using singularity analysis (according to methods developed in [2]). 

We are interested in two types of parameters measuring distance, and develop two methods to 
handle them. We first attack the distances between a special vertex (an outermost vertex A of the 
RANS) and all the other vertices. The method is built on computing a generating function with 
infinitely many variables, that contains all informations concerning distances from A to the other 
vertices. Distribution analysis is based on the study of partial derivatives of this multivariate series, 
which correspond to the series counting the number of vertices at a certain distance from A. These series 
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all express in terms of the generating function for RANS and asymptotic analysis gives a distribution 
with a mean value of order y/iwri/W. 

The second study addresses the total distance between all pairs of vertices. We exhibit a generating 
function in four variables that expresses simultaneously distances from one, two or three outermost 
vertices. This generating function has a nice recursive definition, due to the symmetries of the prob- 
lem. It contains all information to compute the total distance between pairs of vertices. Geometrical 
considerations splits this total distance in two parts, depending on whether a path between two vertices 
spans over disjoints sub-RANS or not. The resulting mean distance between two vertices is of order 
2v / 3™/ll. 

This paper divides in four sections: this introduction, followed by a section that recalls the definition 
of random Apollonian network structures, the bijection with ternary trees, and the result for degree 
distribution. Section 3 describes the distribution of distances from an outermost vertex and section 4 
is dedicated to the study of the total distance between all pairs of vertices. 

2 Random Apollonian network structures 

The recursive definition of RANS shows a one-to-one correspondence with ternary trees. The degree 
distribution, which is a power law with an exponential cut-off, was studied in [1] by considering bivariate 
series marking the corresponding parameter in trees. 

2.1 Bijection with ternary trees 

A random Apollonian network structure (RANS) R is recursively defined as: either an empty triangle 
or a triangle T split in three parts, by placing a vertex v inside T and connect it to the three vertices 
of the triangle; each sub-triangle being substituted by a RANS (see figure [T]) . 

The vertices of T will be called the outermost vertices of R (noted O(R)); and vertex v will be 
called the center of R. We will note TZ the class of all RANS. 

The order of the empty RANS is zero and the order of a non-empty RANS is one plus the sum of 
the orders of the three sub-RANS . 

Proposition 2.1. J7]/ There is a bijection between random Apollonian network structures of order N 
and rooted plane ternary trees of size N (with N internal nodes). 

In planar ternary trees, the linear ordering of siblings is relevant. This order is carried over to 
triangles: naming {Oi, 02,03} the vertices of O(R), imposes a linear ordering on the sub-RANS 
({Si, S2, S3} = S(R)): Si will be the one not containing Oi. Recursively replacing the missing outer- 
most vertex by the center of R preserves the order in sub-RANS. 

The generating function for ternary trees T(z) = ^ T^z N satisfies the functional equation T(z) = 
1 + zT 3 (z), whose solution can be analysed locally through singularity analysis. T(z) has radius of 
convergence p — 4/27 and singular value r = 3/2; and the singular expansion of T(z) near p is 

T{z) = I - ^^zTp+ |(1 - zip) - ^(1 - z/p) 3/2 + O ((1 - z/pf'>) . (1) 

Thus the asymptotic form of the coefficients: T/v ~ cp~ N N~ 3 / 2 , with c = 

The derivative T (z) = ^^p^ will also appear in the computations below. The leading term in 
its singular expansion is ^(1 — z/p)^ 1 / 2 , thus a coefficient T' N of asymptotic order 27 "^ p~ N N^ 1 / 2 . 
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Figure 1: A random Apollonian network and its corresponding ternary tree 



2.2 Degree distribution 

The degree distribution in random Apollonian network structures follows a power law with an expo- 
nential cutoff. This is obtained by analysing the degree of the center of a RANS (which corresponds 
to the size of a binary subtree at the root of the corresponding ternary tree), and propagating this 
study to the whole of the sub- RANS. 

The bivariate generating function marking the degree of the center is D g (z, u) — zu 3 T 3 (z, u), where 
T(z, u) is the bivariate generating function for ternary trees with u marking the size of the underlying 
binary subtree, which is also the degree of an outermost vertex: 

T(z,u) = 1 + uzT(z)T 2 (z,u). (2) 

Theorem 2.2. fj]/ The degree distribution in random Apollonian network structures follows a power 
law with an exponential cutoff: Pr(D g — k) ~ C f3 k k~ 3 / 2 , with [3 = |. 
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Figure 2: RANS of type (0, 1, 1), (0, 0, 1), (0, 0, 0) and their sub-RANS. 



3 Distance from an outermost vertex 

This section is devoted to computing the distribution of the distances from a fixed specific outermost 
vertex. We introduce a generating function with infinitely many variables, each variable Ui marking 
the number of vertices at distance i from the outermost vertex. Relying on the symmetries of the 
problem and the recursive nature of RANS, we are able to express and study this generating function. 

The interest of this analysis is not only to find, with a different method, some results of the 
following section; but moreover this study can be adapted to compute the distribution of distances to 
any distinguished vertex in a RANS, which may be considered as a more realistic parameter. 

3.1 Multivariate generating function 

Due to the recursive nature of RANS, we often have to consider a RANS R as having an environment, 
that is a bigger RANS containing R. Given a RANS R, the distance of any of its vertices to a vertex 
v in the environment of R is determined by the three distances of the elements of O(R) to v. 

Since the outermost vertices of R form a clique, their distances to any vertex cannot differ by more 
than one. This observation allows us to reduce our study to a few cases. First we work modulo a 
translation and restrict ourselves to the case when the three distances to O(R) are either 1 or 0. Second 
we can work modulo a permutation and restrict ourselves to only three cases: (0, 1,1), (0, 0, 1) and 
(0,0,0), illustrated in figure [2j These three cases actually correspond to labelling the internal vertices 
of R by their distances either to one (out of three), or to two (out of three) or to all three outermost 
vertices. 

Definition The 8 (1) -labeling of R € TZ consists in putting on each vertex a label corresponding to its 
distance from Oi(R) (or equivalently to one of any Oi(R)): 

• the outermost vertices 0\,Oi,0-s respectively receive labels 0, 1 and 1; 

• the center of R! € S(R) is labeled by 1 plus the minimum of the labels of O(R'). 

Definition Let's define the type of R £ 1Z as the set of labels of O(R). We say that 

• two RANS types are equivalent iff they have the same type up to a permutation. 

• two RANS types are translated by 9 iff their labcllings are the same up to a translation 0. 

In a RANS R of type (0, 1, 1) the center gets label 1. Thus R is equivalent to ^(i?) and Sz{R), 
but it is not equivalent to S\(R), which is of type (1, 1, 1): if a vertex is at distance d from Oi(Si(R)), 
its distance from Oi(R) is d+ 1. 

This remark leads to the bivariate generating function for RANS marked with vertices at distance 
1 from Ot(R): 

Ti(z,m) = 1 + zu\Tl(z,u\)T{z). (3) 
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This follows the recursive definition of RANS, noticing first that the center is at distance 1 from 0\(R) 
and second that the configuration is the same in 52 (R) and Ss(R), whereas in S\(R) there is no vertex 
at distance 1 from 0\{R). Note that (J3j is obviously the same generating function as the degree 
of a vertex being exactly the number of vertices at distance 1 from this vertex. 

The problem of marking both vertices at distance 1 and 2 from 0\(R) is treated in the same way: 
the configuration of R, of type (0, 1, 1), recursively occurs in S 2 (R) and Ss(R), which are of the same 
type. But the case of Si(R), of type (1, 1, 1), is a little more tricky and requires a deeper decomposition. 
If Si(R) is not empty, its center is at distance 2 from Oi(R), and its three sub-RANS 5(5*1 (R)) are 
equivalent, of type (1,1,2): either they are empty, or their center is at distance 2 from Oi(R), one 
sub-RANS is of type (1, 1, 2), equivalent to S(Si(R)) and the two others are of type (1, 2, 2), translated 
by 1 with R, which means that the number of their vertices at distance 2 from Oi(R) in is the same 
as the number of vertices at distance 1 from 0\(R) in R. 

This decomposition leads to the following functional equations, with Uj marking vertices at distance 
j from Oi(i?): 

T 2 (z,u\,u 2 ) = 1 + zuiT2(z,ui,u 2 )F(z : ui,u 2 ) 
F(z,u 1 ,u 2 ) — 1 + zu 2 G 3 (z, «i, u 2 ), 
G(z,ui,u 2 ) = 1+ zu 2 G(z,ui,u 2 )T?(z,u 2 ). 

It is easy to show that the same equations hold for d > 3, when considering multivariate generating 
functions T(z, u\,u 2 , ■ ■ • , u d ), with Uj marking vertices at distance j from 0\(R), and we get the 
following result. 

Proposition 3.1. Let r nj ki,...,ki be the number of RANS of order n with kj vertices at distance j from 
0\. Then rn.fci,...,*^ * s the coefficient of u^ 2 ■ ■ ■ u^ d z n in the multivariate series Td(z, «i, . . . ,Ud), 
where the series satisfy the recurrence relations: 

Ti(z, ui) — 1 + zuiTi(z, ui)Tq(z) with Tq(z)=T(z) 

and for d>2, 



T d {z,U!, ...,u d )=l + zuiT d (z,ui, . ..,u d ) 1 + zu 2 



(1 - zu 2 Tl_ x {z, u 2 ,...,u d -i)y 



The sequence T d (z, ui, . . . , u d ) converges to a function T , o(^, u\, . . . , Uj, . . .) which contains all in- 
formation concerning distances from vertex 0±: 

• The cnumerative series for the number of vertices at distance i from 0\, over all RANS, is 



d 

Di(z)= —Too (2, Ui,..., Ui,...) 
OUi 



— ^ hi r n ] Zi ; 



• The asymptotic of the total distance from 0\ expresses as 

d 



du D(z,u) 



oo 



, where D(z,u) — > J Dj(z)u' 1 
u=l i=1 



The aim of the next paragraph is to evaluate these quantities. 
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3.2 Distribution analysis 



Generating functions counting the number of vertices at distance i from 0\ express as rational functions 
in z and T(z), and have a singular behaviour similar to T{z): radius of convergence p, and singular 
expansion of the square-root type. 

Lemma 3.2. The sequence of enumerative series for the number of vertices at distance i from 0\ is: 

D 1 {z) = zT\z)/{\~2zT 2 {z)), 

D 2 (z) = H(z, T(z)) x (1 + 2z 2 T i (z))/(6zT(z)(l - 2zT 2 (z))) 
and for i>2 A+iO) = H l - 1 (z, T{z)) x D 2 (z), 

where H(z,T(z)) is a rational function in z and T(z), that has radius of convergence p = 4/27, and a 
singular expansion H(z) = 1 — T^yl — z/p + |(1 — z /p) + (1 — z/p) 3 / 2 + 0((1 — z/p) 2 ). 

Proof. From ( |3.1| , it is easy to compute the expressions of D\(z) and D 2 (z), and show that D i+ i(z) = 
H(z,T(z)) x AO*), with H(z,T(z)) = 6z 2 (T(z) - l)T(z)/(l - 3z - zT(z) - zT 2 (z) + 2z 2 T 2 (z)f\ 
The singular expansion comes from expressing z as (T(z) — 1)/T 3 (z) and plugging in H the singular 
expansion ([!]) of T(z). A full expansion of T(z) yields a full expansion for H(z), the first terms of 
which are given in the lemma. □ 

The full singular expansion of D^z) can be derived from its expression in terms of H and D 2 . 
Thus the proportion of vertices at distance i from 0\, that is :r ^[z n ]D i (z) can be evaluated. We 
have no closed form to express these quantities (it is work in progress) , but plotting from experimental 
results obtained on a sample of randomly generated RANS, shows that the distances from 0\ follow 
the distribution shown in figure [3] 




Figure 3: Distances from 0\. The points represent the experimental data on RANS of orders 1000- 
1400. 



The average distance from 0\ is also obtainable by derivation of D(z,u) = J2Di(z)u l . From 
lemma 3.2 there is a closed form for D(z,u), and derivation leads (fortunately) to the same series as 
the one obtained for A(l)(z), in section 4.1 



bmce Q[z,T(z)]/(T(z) -1-T 3 (z)) is a (Q(z)-vector space with dimension three, all rational functions in z and T(z) 
that appear in this paper can be expressed in a canonical form. However we didn't use it since it usually hides the 
combinatorial interpretation of the generating functions under consideration. 
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This scries has a singular expansion around p with first term — z/ p) 2 , so for the mean distance 

^sH^-tF( 1 + <S>)- 

We thus conclude this section with the following proposition: 

Proposition 3.3. In a RANS of order n, the average distance from 0± is of order c^/n, with c = 

4 Total distance between pairs of vertices 

In this section, we are interested in computing the total distance of every pair of vertices in a RANS 
of order n, and will show that the mean value of this quantity is still of order y/n. 

We call C(R) the set of pairs of vertices (we call pair a set of size two) in R 6 1Z, excluding pairs 
where both vertices are in O(R). 

The enumerative generating function for the total distance between pairs in C(R) is 

G ( z ) = H dist(v,w) 

ReR (v,w)ec(R) 

C(R) splits into two parts 

• the pairs (v, w) such that they are both internal vertices of the smallest sub-RANS of R that 
contains both of them, corresponding to case 7 in figure [4] 

We will note them Inter(i?) and their contribution to the total distance will be called interdis- 
tance. 

• the others, which can also be defined as the pairs (u, to) such that there exists a sub-RANS S of 
R with v an outermost vertex of S and w an internal vertex of S, corresponding to cases 2, 3 
and 4 in figure |4j 

We will note them Intra(i?) and their contribution to the total distance will be called intradis- 
tance. 



A'AAAA 

Figure 4: All possible configurations for pairs of vertices in R G 1Z. For each pair in C(R) we can 
always find a unique sub-RANS of R such that the two vertices are in one of configurations 2, 4 or 
7. The other cases reduce to one of these: case 1 reduces to 2 by looking at the RANS containing R, 
case 3 leads to 2, 3 or 4 by looking in the sub-RANS containing the two vertices, and such is also the 
case for case 5. Case 6 is amenable to any of 5, 6 or 7 by looking at the sub-RANS containing the two 
vertices. 
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Remark: C(R) has n(n — l)/2 + 3rt elements: each pair of internal vertices is counted only once, 
and the 3n term takes into account all pairs made of one internal vertex and one outermost vertex. 
Among all these pairs, an amount of order n^pn belongs to Intra(i?) and the rest is in Inter(i?). As 
we will show, the total distance of pairs in Intra(i?) is of order n 2 and the total distance of pairs in 
Inter(i?) is of order n 2 y/n. We can thus say that the interdistance gives the dominant term of the total 
distance in RANS, which is of order y/n. 

We introduce in the following subsection a new generating function which serves as a basis for the 
computations of all the quantities that are needed. Then we calculate the intradistance followed by 
the interdistance. Putting everything together gives the following result. 

Theorem 4.1. The mean distance in a RANS of order n is asymptotically equivalent to Cy/n, with 
C = 

4.1 Topological generating function 

Given R £ 1Z, the distances of inner vertices to O(R) are denoted by the three following parameters: 
A ( l)(i?) = d(x, Ot(R)), A ( 2)(i?) = d(x, {0 1 (R),0 2 (R)}) and A ( 3)(R) = £ d(x, 0{R)). 

x£R x€R xeR 

Notice that A(l)(i?) is the sum of all the labels in the <5( relabeling of R, based on RANS of type 



(0,1,1) — see (3.1l. Similarly, A(2)(i?) is the sum of all the labels in the 5(2)-labeling of R, that 
starts with a RANS of type (0,0, 1); and A(3)(i?) is the sum of all the labels in the <5(3)-labeling of R, 
starting with a RANS of type (0, 0, 0). 

In the following generating function, parameter A(i) is marked by variable d(i): 



DC 



A(z, d ( l), d ( 2), d ( 3)) = £ d { l) A ^d ( 2) A ( 2 ^d ( 3) A ^ R h^ = a n,i,i,k d ( l)M ( 2)^ ( 3) fe z", 

ReTZ n,i,j,k=0 

where cx-n^j^k is the number of RANS of order n (i.e. with n internal points), and respective values 
fc for parameters A(l), A(2), A(3). This generating function is called the topological generating 
function since it expresses the distances according to the three different topological types of RANS. 

Proposition 4.2. The topological generating function satisfies the recursive equation 

A(z,d ( l),d ( 2),d ( 3)) = 1 + zd ( l)d ( 2)d(3) x A(zd ( l), d ( 2), d ( 3), d ( l)) 

x A(z,d ( l),d ( 2)d(3),l) 
x A(z,d(l)d ( 2),d ( 3),l). 

Proof. Let's follow the recursive definition of RANS R. If R is empty the contribution to the series 
is 1. Otherwise it has a center, which is at distance 1 from each of the outermost vertices (hence the 
factor zd(l)d(2)d(3)) and the contributions come from the 3 sub-RANS. 

Factor A(zd(l), d(2), d(3), d(l)) comes from Si(R). Suppose Si(R) has, by itself, a generating series 
A(z, d(l), d(2), d(3)), that corresponds to the three different labellings, with types (0, 1, 1), (0, 0, 1) and 
(0, 0, 0). When it is considered as embedded as the first sub-RANS of R, the top most vertex has label 
1 instead of 0, so that the three different labellings now start with types (1, 1, 1), (1, 0, 1) and (1,0, 0). 
Thus variable dT) transforms into d(2), variable d(2) transforms into d(3), and variable d(3) transforms 
into d(l) with a 1-translation. 
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Factor A(z, d(2)d(3), 1) comes from 52 (i?). Suppose 52 (i?) had, by itself, a generating series 
A(z, d(2), d(3)), corresponding to the three different types (0, 1, 1), (0, 0, 1) and (0, 0, 0). When it 
is considered as embedded as the second sub-RANS of R, 02(S2(R)) has to have label 1, so that the 
three different labellings now start with types (0, 1, 1), (0, 1, 1) and (0, 1, 0). Thus variables d(2) and 
<f(3) transform into di2), variable d(X) stays dfl), and nothing transforms to dr3). 

Factor A(z, d(l)d(2), di3), 1) comes from Ss(R), and the proof is equivalent. □ 

The series of cumulated distances from A is obtained by derivation 



A ( l)(z)= A ( l)(i?)^'= -^-A(z,d ( l),l,l) 
Ren ( ' 



i ( i)=i 



and the same holds for the two other cases. 

Proposition 4.3. The distance generating functions A(i)(z) have the following expressions: 

A ( l)(z) = zT{z) 3 {\ - 2zT(z) 2 + z 2 T(z) 4 - 6z 3 T(z) 6 )/Q(z, T{z)) 
A ( 2)(z) = zT{z) 3 {\ - 3zT{z) 2 + 4z 2 T(z) 4 - 6z 3 T(z) 6 ) /Q(z,T(z)) 
A ( 3)(z) = zT{zf{\ - 3zT(z) 2 + 2z 2 T{z) A )/Q{z, T(z)), 

where Q(z,T(z)) = (1 + 2z 2 T(z) 4 )(l - 3zT(z) 2 ) 2 . 

Each Ari)(z) has radius of convergence p and a singular expansion of the form: 

A ( i)(z) = 3/(44(1 - zjp)) + 0(1 - zip)- 1 ' 2 . 

Proof. The distance generating functions A/i)(z) satisfy the system of equations: 

A ( l)(z) - zT 3 (z) + 2zA { l)(z)T 2 (z) + zT 2 (z)(zT'(z) + A { l)(z)) 
A ( 2)(z) = zT 3 (z) + 2zA { l)(z)T 2 (z) + zA(3){z)T 2 {z) 
A(3){z) = 3zA(2){z)T 2 {z) + zT 3 (z) 

where T'(z) = T 3 (z)/(l - 3zT 2 (z)). 

The resolution of the system shows that each A(i)(z) has a dominant term that expresses as T' 2 (z) 
with the same constant factor, thus a pole in z = p. The singular expansions only differ on their 
second term. □ 



4.2 Intradistance 

We first consider the pairs (v, w) such that there exists a sub-RANS 5 of R with v an outermost and 
w an internal vertex of 5. There may be many embedded sub-RANS 5 and we focus on the smallest 
one, Sq. In So, vertex v is outermost (e.g. 0\) and w is either the center of So or in the sub-RANS 
opposite to v (e.g. 5i(5o)) (cf. cases 2 and 4 in figure [4}. 

We will first study the pairs Intra! (i?) , for which So — R, and then recursively extend the compu- 
tation to the rest of the intradistance. 

Lemma 4.4. The generating function for the total distance of pairs in Intrai(R), satisfies 

5{z) = 3T{z) + 3zT 2 (z)A 3 (z) + 3z 2 T 2 (z)T' \z). 
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Proof. The distance of Intrai(i?) is made of two categories of distances: 

• from the center of R to each of the outermost vertices O(R), 

• from each outermost vertex Ot(R) to all the internal vertices of its opposed sub- PLANS, Si(R). 
The distance from an outermost vertex Oi(R) to an internal vertex w of Si(R) is l+d(w, 0(Si(R))). 

Thus, the distance from an outermost vertex Oi(R) to all the internal vertices of Si(R) is |Sj(J?)| + 
A(3)(5j(i?)). Taking into account all three sub-RANS of R we have 



Theorem 4.5. The generating function for intradistances in a RANS is Intra(z) = S(z)/(l — 3zT 2 (z)) 
and the total distance between pairs of vertices in Intra(R), for R € TZ n , is asymptotically jgn 2 . 

Proof. The total intradistance is obtained by recursively computing intradistances at any level of the 
RANS. The effect of this recursion process, akin to recursive decent in subtrees of ternary trees, is to 
multiply the generating function by T'(z)/T 3 (z), that is 1/(1 — 3zT 2 (z)). The dominant term in the 
singular expansion of Intra(z) thus is 3zA(l)(z)T' 2 (z)/T(z). The total distance in Intra(i?) is obtained 
by evaluating [z"]Intra(z)/T„. □ 

4.3 Interdistance 

We now consider the pairs (v, w) such that they are both internal vertices of the smallest sub-RANS 
S of R that contains both of them. 

Since S is minimal by inclusion, v and w are in different sub-RANS of S, which we will call S v and 
S w . The shortest path from v to w passes through at least one of the two vertices of 0(S V ) H 0(S W ) = 
Frontier S w ). We can thus decompose this path in three sub-paths: from v to Frontier (S v , S w ), from 
Frontier(5V,, S w ) to w and, if these two sub-paths are disjoint, an one-edge path along Frontier(S'„, S w ). 
We will call this last part the f-edge. This decomposition is illustrated in figure [5] 

We will first compute a lower bound Inter - (ii) of the interdistance by neglecting the f-edges. This 
lower bound gives a total distance on pairs of Inter (i?), with R € lZ n order n 2 *Jn. We can also take 
an upper bound Inter -1 " (R) by forcing every path to pass from the center of R and this still gives a 
contribution of order n 2 y/n with the same factor. Counting the exact number of f-edges allows us to 
compute the following terms of the interdistance. 

4.3.1 Lower bound and upper bound 

As for the intradistance we first compute a lower (resp. upper) bound on interdistances at the topmost 
level Interi(i?) (i.e. S v , S w € S(R)) and extend it recursively for the whole RANS. 



Figure 5: The four possible scenarios for paths between pairs of vertices in Inter (i?). The three colors 
correspond to the three sub-paths, the red one being the f-edge. 




thus the expression of the generating function as stated in the lemma. 



□ 
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Lemma 4.6. The generating function for lower bound (resp. upper bound) of the total distance of 
pairs in Interi{R), 7~( z ) (resp. ~f + (z)), satisfies 

7 -(z) = 6z 2 T(z)T'(z)A ( 2)(z) 7 +(z) = 6z 2 T(z)T'(z)A ( l)(z). 

Proof. At level one, for each sub-RANS, the contribution to the interdistance is the total length of the 
sub-paths contained in this sub-RANS. Thus for each vertex v, situated in a sub-RANS Si(R), we will 
count its distance to the frontier, multiplied by the number of vertices in S^i?) and S^i?). The lower 
bound is obtained by adding all these values (parameter A(2)), and for the upper bound we consider 
that the frontier is reduced to only one of its two points (parameter AT)). Thus the expression of the 
generating function 7^ (z) (and 7 + is obtained by replacing A(2) by AT)): 

7 -(z) = 3 A(2)(5 x (i?)) x (|S a (ii)| + \S 3 (R)\) z™. Q 
Ren 

Theorem 4.7. The generating function for the lower bound (resp. upper bound) of interdistances in 
a RANS is + 

Inter~(z) — — — ^ . , Inter + (z) — — — ^ . , 

v ; 1 -3zT 2 (z) v ; 1 -3zT 2 (z) 

and in both cases the total distance between pairs of vertices in Inter(R), for R £ lZ n , is asymptotically 
Cn 2 y/n with C = V3tt/11. 



Proof. The proof is similar to the proof of theorem 4.5 The generating functions Inter (z) and 



Inter + (z) have both a dominant term in 6A(i)(z)z T u (z)/T 2 (z). □ 



4.3.2 Exact computation 

To know whether the path between two vertices (v,w) G Inter(i?) contains a f-edge it helps to know 
the distances from v and w to each of the two vertices (/1, ^2) of Frontier (S v , S w ). 
We distinguish two cases: 

• The first one when either d(v, fi) — d(v, fa) (we then say that v E £(R)) or d(w, fi) = d(w, fa)- 
In this case the path between v and w does not contain a f-edge. 

• Otherwise either v and w are both closer to the same vertex of the frontier or each one of them is 
closer to a different vertex of the frontier. In the first case there is no f-edge on the path between 
v and w while on the second case there is one. 

These two cases being equiprobable, thanks to the symmetry, it is sufficient to calculate the number 
of pairs (v, w) € Inter(i?) for which v,w £ £(R), and the number of f-edges will be the half of this 
quantity. 

Lemma 4.8. The generating function for the number of f-edges in the total distance of Inter\{R) is 

<j>[z) = I (z 3 T(z)T' 2 (z) - 2z 2 T(z)T'(z)E(z) + zT{z)E 2 {z)) 
where E(z) = £ BeK \£(R)\z^ . 

Proof. The number of pairs in Interi(i?) for which v,w (fc £{R) has generating function 

<t>(z) = 3 J2 (ISil ~ E(S 1 ))(\S 2 \ - E(S 2 ))zW. n 
Ren 
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Calculating E(z). We use a bivariate generating function T e (z,u) for RANS marked with vertices 
at equal distance from 0\ and C>2- This will be defined by a system of thirteen equations in the same 
spirit as in section |3~T| The analysis of this system is too long to be included in this abstract, but it 
leads to 



2T 2 0) V (2T(z) - 3)(3T 2 (z) - 4T(z) + 2 

and a singular expansion around p which is equivalent to x (1 — z/p)~ 1//2 . 
Theorem 4.9. The generating junction for the number of f- edges in a RANS is F(z) = </>(z)/(l 

242 



3zT 2 (z)) and the total number of f- edges in R, for R £ lZ n , is asymptotically ^Q-n 2 



Proof. The proof is similar to theorem 4.5 But in this case, each term of <p(z) gives a part of the 
dominant contribution. □ 



4.4 Conclusion 

Summing the contribution of intradistances and interdistances the enumerating generating function 
for the total distance between pairs of vertices expresses as: 

G(z) = Intra(z) + Inter" (z) + F(z), 

which has a closed form expression as a rational function in terms of z and T(z). 

We made an exhaustive study of the different parts of G(z). The contribution coming from in- 
tradistances happens to be of smaller order (n 2 ) than the contribution coming from interdistances 
(n 2 y/n). In the computation of interdistances, we first considered approximations that give a lower 
and an upper bound with the same dominant term (^^-n 2 ^/n) that is a mean distance in 2s {^ \fn. 
The study of f-edges provides an exact computation of the total distance. With this contribution of 
f-edges it is possible to express the second term in the asymptotic expression of the total distance. 
Moreover, relying on full singular expansion of all series under consideration, it is possible to give a 
full asymptotic expansion of the total distance. 
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